PROTECTION-OF-TELOMERE-1 (POT-1) PROTEIN AND 
ENCODING POLYNUCLEOTIDES 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application is a continuation of U.S. Patent Application Serial No. 

09/816,248, filed March 26, 2001, and entitled "Protection-of-Telomere-1 (POT-1) 
Protein and Encoding Polynucleotides." The entire disclosure of U.S. Patent 
Application Serial No. 09/816,248 is incorporated herein by reference . 

BACKGROUND OF THE INVENTION 

Telomeres are the protein-DNA complexes that protect the ends of linear 
eukaryotic chromosomes from degradation, prevent end-to-end fusions and partake in 
chromosome localization and segregation (Cooper, Curr Opin Genet Dev 10: 169-77, 
2000; McEachern et aL, Annu Rev Genet 34: 331-358, 2000; Price, Curr Opin Genet 
Dev 9: 218-24, 1999). Telomere length, 15-20 kb in human embryonic or germ line 
cells, is maintained in part by the enzyme telomerase. In the absence of telomerase 
activity, about 50-200 bases of DNA are not replicated with each round of cell 
division, resulting in the eventual diminution in telomere size to typically 5-7 kb. At 
that length, cells enter a state of arrested growth called replicative senescence. The 
maintenance of telomere length thus is believed to play a key role in the ability of 
cells to avoid replicative senescence and to propagate indefinitely, as is the case with 
stem cells. Likewise, aberrant maintenance of telomere length is believed to underlie 
indefinite cellular proliferation characteristic of cancer cells (Bodnar et aL , Science 
279: 349-352, 1998; Bryan et ah, 1997; McEachern et aL , 2000). 

Telomeres consist of repeating units of GC-rich DNA and terminate in a 
single stranded extension of the 3' strand. Oxytricha nova telomeres, for example, 
consist of tandem repeats of (TTTTGGGG) and end with a 16 nucleotide overhang of 
the G-rich strand. By contrast, human telomeres have a repeating sequence 
(TTAGGG)n and end with a 50-100 nucleotide overhang of the G-rich strand. 
McEachern et aL, 2000. 

A number of proteins have been identified that specifically interact with the 
double-stranded portion of the telomere or the single-stranded 3' extension at its very 



end. Among the most well characterized are the telomere end -binding proteins from 
hypotrichous ciliated protozoa (Gottschling et aL, Cell 47: 195-205, 1986; Price et 
aL, Genes Dev 1: 783-93, 1987). The a and p subunit of the 0. nova Telomere 
End-Binding Protein (TEBP) bind specifically to the 16 nucleotide single-stranded 
extension at the ends of macronuclear chromosomes (Gray et aL, Cell 67: 807-14, 
1991) and form a ternary complex whose structure has been determined using X-ray 
crystallography (Horvath et aL, Cell 95: 963-974, 1998). Although both protein 
subunits directly interact with DNA in the ternary complex, only a binds telomeric 
DNA by itself (Fang et aL, Genes Dev 7: 870-82, 1993). The DNA binding domain 
in the a subunit has been mapped to the N-terminal two-thirds of the polypeptide 
(Fang et aL, 1993) and is comprised of two "OB folds" (Horvath et aL, 1998). In 
vitro reconstituted a-DNA complexes are substrates for telomerase, whereas ot-p- 
DNA complexes are not; an observation which may indicate a function in the 
regulation of telomere length (Froelich-Ammon et aL, Genes Dev 12: 1504-14, 
1998). 

The protrusion of the G-rich strand as a single-stranded overhang is 
conserved between ciliates (Klobutcher et aL, Proc Natl Acad Sci U S A 78: 3015- 
19, 1981), yeast (Wellinger et aL, Cell 72: 51-60, 1993) and mammalian cells 
(Makarov et aL, Cell 88: 657-66, 1997; McEUigott et aL, Embo J 16: 3705-14, 
1997; Wright et aL, Genes Dev 11: 2801-09, 1997), suggesting the existence of 
similar functional mechanisms in telomere maintenance. However, proteins sharing 
sequence homology with ciliate TEBPs were not identified in the complete 5. 
cerevisiae genome or among the proteins that bind single-stranded telomeric DNA in 
vitro. Similarly, the S. cerevisiae single-stranded telomeric DNA-binding protein 
cdcl3p has not been proposed to be homologous to the ciliate TEBPs, nor have 
cdcl3p homologues been identified in distantly related species. (Ishikawa et aL, Mol 
Cell Biol 13: 4301-10, 1993; Lin et aL, Proc Natl Acad Sci U SA 93: 13760-65, 
1996; McKay et aL, Nucleic Acids Res 20: 6461-64, 1992; Nugent et aL, Science 
274: 249-52, 1996; Virta-Pearlman et aL , Genes Dev 10: 3094-104, 1996). 
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The apparent absence of specific end-capping proteins in some eukaryotes has 
been explained by the adoption of a telomere structure distinct from that found in the 
macronuclei of hypotrichous ciliates. This telomere structure, found at the ends of 
mammalian and O. fallax chromosomes, is a large duplex loop, or "t loop," created 
by the sequestration of the single-strand overhang within the double-stranded portion 
of the telomeric tract (Griffith et aL y Cell 97: 503-14, 1999; Murti et ai y Proc Natl 
Acad Sci USA 96: 14436-39, 1999). In mammals, this architecture is believed to be 
maintained by a number of proteins, including the TTAGGG-binding factors, TRF1 
and TRF2. TRF2 is believed to catalyze the sequestration of the single-stranded 
DNA into the duplex region of the DNA. Consistent with this notion is the 
observation that TRF2 can cause telomeric DNA to form t loops in vitro (Griffith et 
al. % 1999). Other proteins have been implicated in telomere architecture and 
regulation, including TIN2, which was identified by its ability to interact with TRF1 
(Kim etai, 1999). 

The ability to manipulate telomere structure and metabolism depends on the 
identification of those components required for the regulation of telomere structure. 
Evidence has accumulated that telomerase activity itself is not determinative of 
telomere elongation or replication. For example, some cancer cell lines maintain 
telomeres in the absence of telomerase activity (Bryan et a/., 1997). There is thus a 
pressing need in the art to identify the functional components that regulate telomere 
metabolism, to identify compounds that can be used to control the entry, avoidance, 
or exit of a cell from a state of replicative senescence. Such compounds may be 
useful alternatively in allowing the indefinite propagation of useful cell lines or in 
halting the growth of cancer cells in vivo for therapeutic purposes. 

SUMMARY OF THE INVENTION 

The present invention addresses this need by providing a protein that caps the 
very ends of human chromosomes, and a related protein that caps the ends of 
chromosomes in fission yeast (Schizosaccharomyces pombe). The protein of the 
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invention is termed "Protection of Telomere-1," or "Potlp, " or "Potl protein." 
Specific embodiments of these proteins are those isolated from humans and fission 
yeast, hPotlp and SpPotlp, respectively. Polynucleotides encoding a Potl protein 
are also provided. 

The inventors have found that Potlp binds single-stranded telomeric DNA, 
which is a unforeseen finding, given the apparent absence of end-capping proteins in 
some eukaryotes. Potlp both stabilizes chromosome ends and regulates telomerase 
activity. Accordingly, compounds that stabilize or disrupt the Potlp-DNA 
interaction will be useful in regulating the telomere length of a target cell or cell 
population. The invention thus provides a means of altering cellular life-span, for 
the purpose of either prolonging the life-span of useful cell populations or making 
cancer cells enter replicative quiescence. Useful compounds with these properties 
can be identified through screening methods made possible by the discovery that a 
Potl protein binds single-stranded telomeric DNA. The identification of a Potl 
protein and its encoding DNA also provides a means of developing tools to diagnose 
illnesses such as cancer that may involve altered expression or structure of a Potl 
protein or gene. Such tools include polynucleotide hybridization probes and 
antibodies specific for a Potl protein. 

Accordingly, the invention provides isolated Potl proteins having the 
sequence set forth in SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID 
NO:9, or SEQ ID NO:ll. Variants of these proteins are capable of binding single- 
stranded telomeric DNA and have at least 85% sequence identity with, or differ by 
no more than about 20 single amino acid substitutions, deletions or insertions from, a 
sequence set forth in SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID 
NO:9, or SEQ ID NO: 11. The invention also provides an isolated, naturally 
occurring, variant of a protein having the sequence set forth in SEQ ID NO: 13 or in 
SEQ ID NO:9, which may be a splicing variant. Fragments of the Potl proteins of 
the invention are capable of binding single-stranded telomeric DNA, and comprise 
the polypeptide having the sequence set forth in SEQ ID NO:5 or SEQ ID NO:6. 
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The invention further provides an isolated non-genomic polynucleotide 
encoding one of the aforementioned proteins. A vector comprising such a 
polynucleotide and a host cell comprising the vector also are provided. The 
polynucleotide may be included in a pharmaceutical composition, along a 
pharmacologically acceptable excipient, diluent, or carrier. A method of detecting 
or measuring the presence of a POT1 polynucleotide comprises contacting the a 
POT1 polynucleotide, or its complement, with a biological sample from an 
individual. 

An antibody, or a fragment or variant thereof, is provided, which is capable 
of binding a Potl protein. A method of raising the antibody comprises isolating the 
antibody from an animal or isolating an antibody-producing cell from an animal, 
following administration of a Potl protein, or an antigenic fragment thereof, to the 
animal. An antibody of the invention may be useful in detecting or measuring the 
presence of a Potl polypeptide in an individual, by contacting the antibody with a 
biological sample from an individual. 

The invention provides a method of increasing the life-span of a cell, by 
inserting a vector comprising a POT1 polynucleotide into the cell, where the POT1 
polynucleotide is operably linked to a promoter that allows the polynucleotide to be 
transcribed. The vector comprising a POT1 polynucleotide may be administered to 
an individual in a pharmaceutical composition, comprising the polynucleotide and a 
pharmacologically acceptable excipient, diluent, or carrier. In one embodiment, the 
carrier is capable of preferentially delivering the polynucleotide to a specific cell 
population. In another embodiment, the vector comprising the POT1 polynucleotide 
is inserted into the cell in vitro, which then may be subsequently administered to an 
individual. The target cell may express a second polynucleotide that encodes an 
exogenous protein, such as a therapeutically useful protein. 

A method of identifying a compound that interferes with the binding of a Potl 
polypeptide to single-stranded telomeric DNA comprises determining whether the 
candidate compound decreases the binding of the Potl polypeptide to a single - 
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stranded telomeric DNA molecule in a mixture of the single-stranded telomeric DNA 
molecule, the polypeptide, and the candidate compound. The compound identified 
by this method may be formulated in a pharmaceutical composition. 

A method of decreasing the life-span of a cell comprises reducing the level of 
Potlp activity in a cell. The cell may be an immortal cell line, such as a cancer cell. 
In one embodiment, the method comprises delivering one of the compounds that 
interferes with the binding of a Potl polypeptide to single-stranded telomeric DNA. 

BRIEF DESCRIPTION OF THE FIGURES 
FIGURE 1A: Multiple sequence alignments of the N-terminal regions of yeast and 
human Potlp and the a subunits of ciliate TEBPs (Ec, Euplotes crassus [SEQ ID 
NO:l]; Sm, Stylonychia mytilis [SEQ ID NO:2]; Ot, Oxytricha trifallax [SEQ ID 
NO:3]; On, Oxytricha nova [SEQ ID NO:4]; Hs, Homo sapiens [SEQ ID NO:5]; Sp, 
Schizosaccharomyces pombe [SEQ ID NO: 6]). The numbers of the first and last 
amino acid shown are depicted at the beginning and end of each sequence. 
Sequences were aligned in ClustalW using the Blosum35 score table followed by 
manual adjustment. Shaded amino acids are conserved in 4 or more sequences. 
FIGURE IB: Domain structure of the O. nova TEBP and yeast and human Potlp. 
Position of OB folds (Horvath et al. 9 Cell 95: 963-74, 1998) and functional domains 
(Fang et al. 3 Genes Dev 7: 870-82, 1993) are depicted for the O. nova TEBP. The 
position of the regions aligned in FIGURE 1 A are indicated by open boxes. 
FIGURE 1C: Morphological phenotype associated with deletion of potl*. Colony 
morphology of potl*, potl\ ml* and trtl' following tetrad dissection and 
germination. 

FIGURE ID: Phase contrast micrographs of potl* and potl' cells 5 to 10 
generations after germination. 

FIGURE IE: Cells as in FIGURE ID but stained with DAPI to reveal chromosome 
segregation defect in potl\ 
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FIGURE 2A: Telomere phenotype in poll strains. Genomic DNA from the 
indicated diploid and haploid strains was digested with Eco RI, which cleaves S. 
pombe DNA about 1.0-1.2 kb from the chromosome ends, and then fractionated by 
1.1 % agarose gel electrophoresis, transferred to a nylon membrane and hybridized to 
a telomeric probe. A probe against the single-copy pola gene was used as a loading 
control. 

FIGURE 2B: Genomic DNA was digested with Afr/I, fractionated by 0.8% agarose 
gel electrophoresis, transferred to a nylon membrane and hybridized to a probe 
against Telomere Associated Sequences internal to the telomere itself (TAS2 
sequences). 

FIGURE 2C: The blot shown in FIGURE 2B was stripped and hybridized to a probe 
against Telomere Associated Sequences that are internal to TAS2 (TAS3 sequences). 
FIGURE 3A: DNA-binding specificity of S. pombe Potlp, using conditions 
described in the Examples. SpPotlp was incubated with the indicated DNA 
substrates. Complexes were analyzed by nondenaturing gel electrophoresis. The 
SpPotlp-DNA complex is indicated by an open arrow. 

FIGURE 3B: Same as FIGURE 3A except that the added protein contained truncated 
Potlp as well as full length protein. Truncated Potlp-DNA complex is indicated by 
a closed arrow. 

FIGURE 4A: Expression of hPOTl and DNA-binding. RT-PCR amplification of 
GAPDH and hPOTl mRNA in various human tissues. 

FIGURE 4B: Binding of hPotlp to human C-strand (SEP ID NO: 19) (CCCTAA).. 
G-strand (SEP ID NO: 20) (TTAGGG). and duplex (SEP ID NO: 21) 
(CCCTAA) 5 # (TTAGGG) 5 . Binding conditions and analysis were as described in 
FIGURE 3. 

FIGURE 5A: Substrate specificity of 5. pombe and human Potlp. Binding of 
SpPotlp to 5. pombe and human G-strand DNAs. 
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FIGURE 5B: Binding of SpPotlp (50 ng) to radiolabeled S. pombe G-strand (1.5 
fmol, or 1 ng) in the presence of 10-, 100-, and 1000-fold excess of unlabeled 
competitor S. pombe, human or O. nova G-strand DNAs. 
FIGURE 5C: Binding of hPotlp to S. pombe and human G-strand DNAs. 
FIGURE 5D: Binding of hPotlp to human G-strand DNAs under same conditions as 
in FIGURE 5B. 

FIGURE 6: Inhibition of telomerase activity by Potlp. Telomerase activity is 
assayed with telomeric primer PBoli82 (SEP ID NO: 22) 
(TGTGGTGTGTGGGTGTGC) as described in Haering et al., Proc. Nat 'I Acad. Sci. 
USA 97: 6367-72, 2000. Unlabeled nucleotides are added to a concentration of 100 
uM as follows: lanes a and b, dATP, dCTP and dTTP; lanes c and d, ddATP, dCTP 
and dTTP; lanes e and f, dATP, dCTP and ddTTP. For lanes b, d, and f the 
oligonucleotide was preincubated with a SpPotlp preparation containing full length 
protein and the N-terminal 22 kDa fragment (100 ng/ul). The Potl protein inhibits 
primer extension by telomerase. 

FIGURE 7: S. pombe POT J genomic DNA. The sequence shown (SEQ ID NO: 7) is 
published by the Sanger Centre as part of cosmid c26H5, having accession number 
SPAC26H5. The sequence contains an upstream promoter sequence, a coding 
sequence, which includes two introns, 1 and 2, and a downstream terminator 
sequence. 

FIGURE 8A: A S. pombe POT1 cDNA sequence (SEQ ID NO: 8), in which both 
introns 1 and 2 have been spliced out. 

FIGURE 8B: A SpPotl protein (SEQ ID NO:9) encoded by the DNA sequence of 
SEQ ID NO:8. 

FIGURE 8C: A splicing variant of the S. pombe POT1 cDNA sequence of SEQ ID 
NO:8, in which intron 2 has not been spliced out (SEQ ID NO: 10). 
FIGURE 8D: The SpPotl polypeptide (SEQ ID NO: 11) encoded by the splicing 
variant of SEQ ID NO: 10. 

FIGURE 9A: A full-length hPOTl cDNA (SEQ ID NO: 12). 
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FIGURE 9B: The hPotlp splicing variant (SEQ ID NO: 13) encoded by the 
polynucleotide of SEQ ID NO: 12. 

FIGURE 9C: Another splicing variant of hPOTl cDNA (SEQ ID NO: 14), having an 
inserted exon indicated by the underlined residues. 

FIGURE 9D: The hPotlp splicing variant (SEQ ID NO: 15) encoded by the 
polynucleotide of SEQ ID NO: 14. The alternatively spliced exon gives rise to a 
protein that is about 50% shorter than full-length hPOTlp and has an alternative C- 
terminus. 

FIGURE 9E: A splicing variant of hPOTl cDNA (SEQ ID NO: 16). An exon is 

skipped, giving raise to a hPotlp with an alternate C-terminus. 

FIGURE 9F: The hPotlp splicing variant (SEQ ID NO: 17) encoded by SEQ ID NO: 

16. 

FIGURE 10A-F: A partial genomic clone of hPOTl (AC004925; SEQ ID NO: 18). 
Exons are in capital letters. 

FIGURE 10G: A scale diagram of SEQ ID NO: 18, showing the relative position of 
exons. Exons are numbered arbitrarily, because the clone does not extend to the 5' 
end of the gene. The exons present in the splicing variants of FIGURE 9 are 
indicated. "Spice variant #1" corresponds to SEQ DI NO: 13, "Splice variant #3" 
corresponds to SEQ ID NO: 15, and "Splice variant #3" is SEQ ID NO: 17. 

DETAILED DESCRIPTION 
OF THE PREFERRED EMBODIMENTS 

The inventors provide a method to control the life-span of a cell. The life- 
span of a cell depends in part on the ability of a cell to replicate its telomeres with 
each round of cell division. A Potl protein stabilizes chromosomes by binding the 
single-strand G-rich 3' extension in the telomere, thereby avoiding loss of telomeric 
DNA and concomitant chromosome fusion or degradation. In the presence of a 
telomere replication mechanism, such as telomerase or telomeric recombination, 
Potlp allows cells to undergo repeated division without reduction in the length of the 
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telomere and attainment of replicative senescence. The isolation of a Potl protein 
and its encoding polynucleotide allows a method of screening for compounds that 
affect the interaction between Potlp and telomeric DNA. These compounds will be 
useful in prolonging or reducing the life span of a cell or population of cells. 

The existence of end-capping proteins in humans and 5. pombe was 
unforeseen, given the previous inability to find such proteins. The inventors found 
that 5. pombe open reading frame SPAC26H5.06 contains a region of modest 
sequence similarity to the a subunits of TEBP from Oxytricha nova and other ciliates 
(FIGURE 1A). Conservation is most apparent over a 95 amino acid stretch near the 
N-termini of the proteins where the 5. pombe and 0. nova sequences share 19% 
identity and 40% similarity. This region coincides with the most highly conserved 
domain within the ciliate sequences (42% amino acid identity [61% similarity] 
between O. nova and E. crassus). Sequence alignments of hPotlp with the S. pombe 
protein reveals the highest conservation near the N-terminus where the S. pombe and 
human proteins share 48% similarity (26% identity) (FIGURE 1A). Over the same 
region, the similarity of the human sequence with the O. nova protein is 39% (23% 
identity). Such levels of similarity and identity are often found between functionally 
unrelated proteins, so they are insufficient to indicate homology; therefore, tests of 
function were performed. No obvious sequence similarity by primary sequence 
alignment is noted between hPotlp or SpPotlp and cdcl3p, the single-stranded 
telomeric DNA-binding protein of S. cerevisiae. 



Potl proteins prevent chromosomal instability. 

The inventors demonstrate by gene knock-out a role of the S. pombe gene, 
potl*, in telomere maintenance. A heterozygous diploid potl + /potl S. pombe was 
constructed by the method described in Baumann and Cech, Mol Biol Cell 1 1 : 3265- 
75, 2000. Tetrad dissections revealed that the potl' daughters formed only very 
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small colonies compared to their potl* sisters (FIGURE 1C). This immediate 
phenotype is in stark contrast to the observations made with strains lacking the 
catalytic subunit of telomerase (trtl), which form wild-type sized colonies upon 
sporulation (FIGURE 1C) and only begin to show a growth defect on the third re- 
streak, when telomeres have shortened considerably (Nakamura et al., Science 282: 
493-96, 1998). For approximately 10 generations after sporulation, pot J' colonies 
contained a large number of elongated cells, most of which failed to undergo further 
divisions (FIGURE ID). DAPI staining revealed a high incidence of chromosome 
missegregation, often leading to daughter cells without any chromosomal DNA 
(FIGURE IE). 

By deleting the S. pombe potl* gene, the inventors have shown that a Potl 
protein plays a pivotal role in preventing instability of chromosome ends in vivo. 
Biochemical and structural data have suggested a role for the Euplotes and Oxytricha 
TEBPs in protecting the very ends of chromosomes; however, because these 
organisms are not amenable to genetic studies, proof of such a capping function in 
vivo has been lacking. This proof is now provided by deletion of the potl + gene, 
which leads to immediate chromosome instability (FIGURE 2). Telomeres could not 
be detected by Southern blotting of genomic DNA from potl' strains (FIGURE 2A). 
Using three DNA probes that recognize distinct subregions of the telomere associated 
sequence (TAS), hybridization signals were only observed with the telomere distal 
TAS3 probe (FIGURE 2C), but not with TAS1 or TAS2 (FIGURE 2B and data not 
shown). These results indicate that around 5 kb of terminal sequence had been lost 
within -30 generations after loss of potl*. 

In contrast to the immediate chromosome instability caused by an absence of 
functional SpPotlp, the absence of functional telomerase causes gradual telomere 
shortening over many generations without an immediate effect on chromosome 
stability and cell viability (Nakamura et al. 9 1998). Thus, at least in 5. pombe, Potlp 
apparently is more important for telomere maintenance in the short term than 
telomerase. 
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Potl proteins specifically bind single-stranded telomeric DNA. 

Potl proteins bind directly to single-stranded telomeric DNA. The SpPotl 
protein was expressed and purified from E. coli, using methodology described 
below, and the ability of the expressed protein to bind DNA was assayed using an 
electrophoretic mobility shift assay. SpPotlp interacts specifically with the G-rich 
strand of 5. pombe telomeric DNA, but not with the complementary C-rich strand or 
double-stranded telomeric DNA (FIGURE 3A). 

N-terminal fragments of the SpPotl protein maintain the ability to bind single- 
stranded telomeric DNA. Several truncated forms co-eluted with the full length 
protein from the Ni-NTA column used to purify the expressed SpPotl protein. 
These polypeptides retain the N-terminal His 6 tag and thus are believed to arise either 
from premature termination or from proteolytic degradation of SpPotlp. These 
truncated proteins had a higher affinity for DNA while retaining the same specificity 
as displayed by the full length protein (FIGURE 3B). Titration experiments 
indicated that the apparent for binding of a predominant N-terminal fragment of 
Potlp to the G-rich oligo is approximately 10 fold higher than for the full length 
protein (10 nM versus 100 nM). Further purification and analysis by mass 
spectroscopy showed that the strong shift (indicated by a closed arrow in FIGURE 
3B, lane d) is attributable to the binding of a 22 kDa N-terminal fragment of 
SpPotlp. Increased DNA binding likewise has been observed with N-terminal 
fragments of the a subunit of TEBP from Oxytricha nova (Fang et al. y 1993). 

hPotlp N-terminal fragments show the same behavior as SpPotlp fragments. 
hPotlp, like SpPotlp, often lacks C-terminal sequences due to degradation or pre- 
mature termination. These truncated forms of hPotlp also show the same DNA 
binding specificity as full length hPotlp obtained from in vitro translation reactions. 
In gel shift assays, hPotlp binds G-rich strands of human telomeric DNA (FIGURE 
4B). As with SpPotlp, binding was not observed with the complementary C-rich 
strand or with double-stranded telomeric DNA. 
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SpPotlp and hPotlp both bind specifically to telomeric DNA. That is, 
binding of both SpPotlp and hPotlp was unaffected by the presence of a 60-fold 
excess of herring sperm DNA and 2000-fold excess of an oligonucleotide of non- 
telomeric sequence. To further investigate the sequence specificity, G-rich strands of 
telomeric DNA from different species were tested as substrates in DNA-binding 
assays. In a side-by-side comparison, SpPotlp bound the human telomeric sequence 
(GGGTTA repeat) with a lower affinity than the 5. pombe telomeric sequence 
(repeating units of the consensus sequence GGTTACA) (FIGURE 5A). In 
competition experiments, a 1000-fold excess of unlabeled 5. pombe sequence 
abolished binding to the radiolabeled substrate, whereas the human and 0. nova 
DNA competitors reduced binding by only -50% and < 2%, respectively (FIGURE 
5B). Similarly, hPotlp showed only weak binding to the 5. pombe sequence 
(FIGURE 5C), which also was not an efficient competitor (FIGURE 5D). In 
contrast, the presence of a 1000- fold excess of the O. nova sequence reduced binding 
to less than 25%. Accordingly, both SpPotlp and hPotlp specifically bind telomeric 
DNA, and each shows a higher affinity for telomeric DNA from their own species. 

Potlp binds a variety of related telomeric DNA sequences. Oligonucleotides 
that form a DNA-Potlp complex, as determined by an electrophoretic mobility shift 
assay, are shown in Table I, below. The affinity between Potlp and the 
oligonucleotide varies with the particular sequence (data not shown). 



TABLE I 

SpPotlp-binding oligonucleotides: 

(SEO ID NOS:23-35, respectively, in order of appearance) 



PBoli52 


GGT 


TAC 


GGT 


TAC 


AGG 


TTA 


CA 


PBoli53 


CGG 


TTA 


CAC 


GGT 


TAC 


AGG 


T 


PBoli54 


GTT 


ACA 


GGT 


TAC 


GGT 


TAC 


GG 
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PBoli86 


TGT GGT GTG TGG GTG TGC GGT T 


PBolillO 


GGT TAC ACG GTT ACA GGT TAC AGG TTA CAG 


PBoIill2 


GGT TAC ACG GTT ACA GGT TAC AGG TTA CAG GGT TAC 
GGT TAC G 


PBolil83 


CTG TAA GCA TAT CAT CAT TCG A GGT TAC 


PBolil84 


GGT TAC GCA TAT CAT CAT TCG A ATC TCG 


PBolil85 


CTG TAA GCA TAT CAT CGG TTA CGG TTA C 


PBolil86 


GGT TAC GGT TAC CAT CAT TCG A ATC TCG 


PBolil87 


CTG TAA GCA TAT GGT TAC TCG A ATC TCG 


PB0HI88 


CTG TAA GC GGT TAC GGT TAC GA ATC TCG 


PT1 


GGT TAC AGG TTA CAG GTT AC 


hPotlp-binding oligonucleotides: 

(SEO ID NOS:36-38, respectively, in order of appearance) 




PBolil77 


TTA GGG TTA GGG TTA GGG TT 


- PBolil78 


GG TTA GGG TTA GGG TTA GGG 


PBolil79 


TTA GGG TTA GGG TTA GGG TTA GGG TTA GGG 



hPOTl mRNA is detected in all tissues examined, although a high steady- 
state level of hPOTl mRNA is observed in testis and lower levels are observed in 
colon, skeletal muscle, and peripheral blood lymphocytes (FIGURE 5 A and data not 
shown). In contrast with mRNA levels of human TERT, which correlate with 
cellular immortality and proliferative activity, the presence of hPOTl mRNA in all 
tissues examined is consistent with hPOTl being a house keeping gene, required to 
ensure the integrity of chromosome ends independently of the proliferative state of 
cells. 

Screening methods to identify useful compounds that affect the interaction of Potlp 
with single-stranded telomeric DNA. 

The use of routine screens to find inhibitors and activators of Potlp is 
facilitated by providing a polynucleotide that encodes a Potl protein, which allows 
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Potlp to be expressed recombinantly. Potlp thus may be expressed in vitro or in a 
host cell, such as E. coli, yeast, or bacullovirus-infected insect cells, and tested 
against candidate compounds. Useful compounds will be those that affect the binding 
between a Potl polypeptide and telomeric DNA, especially the G-rich single-stranded 
component. 

The interaction between Potlp and telomeric DNA is readily assayed in vitro, 
by a number of routine methods that are well known to the artisan. In vitro assays 
can be configured as high throughput assays, to test candidate molecules 
simultaneously. In one embodiment, such assays can be designed around the 
electrophoretic mobility shift assays described in the examples. 

Candidate molecules that will be useful for the invention generally will 
include small organic compounds that interact with a Potl protein or a Potl protein- 
DNA complex to change the binding constant. In one embodiment, candidate 
molecules are rapidly identified by their ability to change the amount of labeled 
probe that interacts with a Potl protein in vitro. Candidates with possible activity are 
then further analyzed to determine an apparent binding constant, which is compared 
to that of the control reaction lacking a candidate molecule, to determine whether the 
particular compound strengthens or weakens the interaction between Potlp and the 
telomere. Promising candidates may be subsequently analyzed in a cell culture 
system, to analyze the effect of the candidate molecule on telomere length or 
integrity throughout repeated cell divisions. The examples describe a number of tests 
that can be used to assay the role of Potlp on telomere structure. 

Likely candidate compounds that will inhibit the interaction between a Potl 
polypeptide include compounds that can act as a substrate analogue. Since the 
substrate for a Potl protein is telomeric DNA, such compounds include single- 
stranded DNA comprising TTAGGG repeats, when used to inhibit a hPotl protein or 
single-stranded DNA comprising GGTTACA repeats, when used to inhibit a SpPotl 
protein. FIGURE 5B, lanes d-e, and FIGURE 5D, lanes g-h and k provide in vitro 
proof of principle of the efficacy of such inhibitors. The oligonucleotides listed in 
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TABLE I represent a variety of useful compounds with a known ability to act as 
substrate analogues. Thus, these oligonucleotides themselves, or analogues of these 
oligonucleotides with advantageous pharmacological properties, will be useful 
compounds for the inhibition of Potlp activity. 

Preferred analogues of these oligonucleotides are non-hydrolyzable DNA 
analogues that have increased pharmacological longevity and efficacy. One DNA 
analogue with enhanced stability relative to DNA is a peptide nucleic acid (PNA) 
molecule that comprises a Potl protein binding site. Such molecules, along with 
methods of their formulation and delivery, are generally described in U.S. Pat. No. 
6,046,307. 

Candidate molecules that will be useful for the invention may also include 
small organic compounds that modulate telomerase activity. These compounds may 
be administered in combination with compounds that regulate Potlp activity. 
Alternatively, these compounds themselves are candidates for regulators of Potlp 
activity, and their possible effect on Potlp activity can be determined by the 
screening methods of the invention. These compounds are described in U.S. Pat. 
Nos. 6,194,206, 6,156,763, 6,110,955, or 6,054,442, for example. 

Methods to extend the lifespan of cells. 

The inventors have shown that chromosome of cells lacking Potlp activity are 
susceptible to rapid disorganization and destabilization. Potlp thus maintains 
telomere structure and function, which provides a means of therapeutic intervention 
in cases where it is desirable to alter telomere structure and function. Methods are 
provided alternatively to stabilize or to destabilize telomere structure, depending on 
the desirability of prolonging the proliferative capacity, or life-span, of the cell in 
question. "Proliferative capacity" and "life-span" both are used in this context in 
terms of how many times a cell can divide before it enters replicative senescence. 

Enhancing the activity of a Potl protein in a cell advantageously can stabilize 
telomeres and thereby prolong the life-span of the cell. Examples of suitable target 
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cells include those that are genetically engineered to produce a desired protein or 
those that produce useful antibodies. Other desirable target cell types include 
isolated stem cells, especially where disease otherwise would deplete various stem 
cell populations. Additional advantageous target cells include cells that proliferate in 
response to repeated tissue injury, such as endothelial cells, or cells whose functions 
are susceptible to aging or disease, such as CD4+ cells, connective tissue 
fibroblasts, or cells affected by age-related macular degeneration. 

Potlp activity can be increased in a number of ways in these desired target 
cells. In one method, Potlp activity is increased by transfecting the cell with an 
expression construct that encodes a Potl protein. In this embodiment, the "effector 
compound" is an expression vector that directs high level or regulated expression of 
a Potl polypeptide. The expression causes higher levels of Potlp to accumulate in 
the target cell, thereby increasing the overall level of Potlp activity or replacing 
Potlp lost through genetic mutation. In another method, the cell is treated with a 
small effector compound that stabilizes the interaction between Potlp and telomeric 
DNA. In either case, the effector compound may be added to a cell ex vivo to affect 
Potlp expression, followed by administration of the cell to the individual undergoing 
treatment. Alternatively, the effector compound may be administered to the cell in 
vivo. In this case a preferable means of administration directs or targets the effector 
compound to the desired cell. Suitable means of cell targeting are known in the art, 
and include liposome encapsulation and antibody-directed targeting, or combinations 
of these two. 

In some instances, it may be desirable to increase Potlp expression 
temporarily. When an effector compound is administered in vivo, this control 
typically can be achieved simply by discontinuing administration. Where Potlp 
expression is increased through recombinant engineering, on the other hand, it may 
be desirable to control Potlp expression with an inducible or regulated promoter. 
Expression then can be induced for as long as desired by administering the 
appropriate inducer or regulatory compound. 
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By contrast, an inhibitor of Potl protein function will be useful in shortening 
the life-span of cells, whose presence is undesirable, through the destabilization of 
telomere structure and function. Such cells include those that are immortalized by 
aberrant expression of telomerase, as in many cancer cell lines. Inhibitors may be 
delivered to the entire body, as is currently common in chemotherapeutic methods. 
Because Potlp is expressed in a variety of cell types in humans, and may be 
expressed ubiquitously, the amount of administered inhibitor must be carefully 
monitored to prevent adverse side-effects to other non-targeted cell types that express 
Potlp. As an alternative or supplement to whole-body delivery, localized delivery 
may be employed. For example, inhibitors can be formulated as a depot for internal 
delivery to the site of a tumor. In another embodiment, inhibitors may be targeted to 
a specific population of cells by one of the many available means of cell targeting, 
such as immunotargeting. 

Parasitic or pathogenic cells, e.g. yeast, whose proliferation or life-span may 
be controlled by regulating telomere length, also are desirable targets for Potlp 
inhibitors. Accordingly, one embodiment of the invention is a method of controlling 
yeast infection through administration of a therapeutically effective amount of a 
Potlp inhibitor. 

FIGURE 6 demonstrates the ability of Potlp to inhibit telomerase action. 
Potlp is believed to inhibit telomerase activity through the formation of a Potlp- 
telomeric DNA complex. Compounds which strengthen or weaken this complex thus 
are expected to affect the level of telomerase activity in a cell. In one embodiment of 
the invention, a method in which Potlp activity is increased in a cell, such as by 
recombinant expression of a POT1 polynucleotide, is combined with the 
administration of a compound that inhibits telomerase activity. A variety of 
telomerase inhibitors are known in the art, as described in U.S. Pat. No. 6,156,763, 
for example. 

Potl polypeptides. 
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The skilled artisan will appreciate that useful variants of a Potl protein 
include those that maintain the capability of binding single-stranded telomeric DNA. 
These variants will be useful, for example, in methods of screening for compounds 
that affect the ability of a Potl protein to interact with single-stranded DNA. Other 
useful protein variants may not exhibit DNA-binding activity, but may be useful for 
other purposes. Such purposes include raising antibodies that specifically bind a 
Potl protein, such as a non- functional, naturally occurring mutation of Potlp. Such 
purposes also include the identification of dominant negative inhibitors that bind 
other cellular proteins that normally interact with Potlp. Variants may occur 
naturally or may be created by modifying the primary sequence of the protein 
through manipulation of a polynucleotide encoding a Potl protein. "Protein" and 
"polypeptide" are used interchangeably throughout. 

"Variants" of an hPotl and SpPotl protein include naturally occurring allelic 
variations of hPotlp and SpPotl proteins, a fragment of a Potl protein that binds 
single-stranded telomeric DNA, or a fragment thereof that elicits an antigenic response 
when administered to a host animal. Variants also include polypeptides that have a 
modified amino acid sequence from the aforementioned polypeptides. Because protein 
function depends on three-dimension structure, skilled artisan will recognize that 
variants bearing the closest structural relationship to hPotlp and SpPotl p are most 
likely to preserve biological function. Sequence modifications include amino acid 
substitutions, insertions, and deletions. Amino acid insertions and deletions may be 
made in the interior of the protein sequence, as well as at the amino and carboxyl 
termini. Guidance in determining which and how many such sequence modifications 
may be made without abolishing biological or antigenic activity may be found using 
computer programs well known in the art, for example, DNAStar software. 

The sequence of variants preferably will have an 80% identity to the full- 
length hPotlp and SpPotl proteins. More preferably, variants will have at least about 
85% identity to the full-length sequences. Even more preferably, the percent identity 
will be at least about 90%, and most preferably, the percent identity will be at least 
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about 95%, or even 98%. Likewise, variants of fragments of hPotlp and SpPotl 
proteins will be useful for the invention, for instance, as antigenic fragments. Such 
variants will have at least about 85% identity to fragments of the hPotlp and SpPotl 
proteins. Even more preferably, the percent identity will be at least about 90%, and 
most preferably, the percent identity will be at least about 95%, or even 98%. 
Preferably, antigenic fragments will be 5, 10, 15, 20, or 30 amino acids in length. A 
preferred biologically active Potlp fragment folds into DNA-binding domain. 
Biologically active fragments include the N-terminal fragments of Potlp identified by 
gel shift assays, including the 22 kDa fragment of SpPotlp. 

Variants may also include "splicing variants." It is well-known that, within a 
given eukaryotic gene, sequences that encode the polypeptide gene product are non- 
contiguous. The protein coding sequences, or exons, are divided by intervening non- 
coding sequences, known as introns. These introns are transcribed but then spliced out 
during maturation of the mRNA. Exons often correspond to functional domains of the 
protein product. Go, Nature 291:90-92 (1981); Branden et ai, EMBO J. 3:1307-10 
(1984). 

Exons themselves may be spliced out during the maturation of the mRNA. In 
some cases, two exons may be mutually exclusive in the mature mRNA. Deletion or 
swapping of exons is known as alternative splicing. Andreadis et a/., Ann. Rev. Cell 
Biol 3:207-42 (1987). The family of proteins produced by alternatively spliced 
mRNAs exhibit different functional properties, depending on which exons are present 
in the mature mRNA. Typically, alternative splicing is regulated in a tissue-specific 
manner and involves only one or a few exons within a gene. 

Thus, the polynucleotides of the invention encompass variants that differ by the 
addition, deletion or alternative splicing of exons. In general, exons alternatively 
added to the 5' or 3' termini of the open reading frame are encompassed by "addition" 
variants, whereas alternatively spliced exons that contribute additional coding 
sequences within the open reading frame are encompassed by "insertion" variants. 
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Specific splicing variants encompassed by the invention are shown in the 
Figures. The SpPOTl gene, for example, has two introns, which normally are 
spliced from the mature transcript. However, in one splicing variant, intron 2 may 
not be spliced, so that it is included in the mature transcript (SEQ ID NO: 10). 
Because the intron does not contain a stop codon, the splicing variant mRNA gives 
rise to a somewhat larger polypeptide (compare SEQ ID NO; 9 and 11). When intron 
1 is not spliced out, however, the resulting protein is truncated as a result of a stop 
codon within intron 1. The resulting peptide has the sequence: (SEP ID NO:39) M 
GEDVIDSLQLNELLNAGEYKIGVRYQWI 
YICFANNEKGTYISVH. Alternatively, translational frame 
shifting may lead to a significantly larger protein product. Translational frame 
shifting has been observed in a number of proteins involved in telomere metabolism. 
Aigner et aL, EMBO /. 19: 6230-39, 2000. Polypeptides resulting from translational 
frame shifting also are considered "splicing variants" for the purposes of the 
invention. 

A more complex pattern of splicing variants is observed in hPOTl 
polynucleotides. In one splice variant, exon 5 is not incorporated into the mature 
transcript (see FIGURE 10G for nomenclature). The resulting polypeptide is 72 kDa 
in size and is shown in FIGURE 9B (SEQ ID NO: 13). When exon 5 is included in 
the mature transcript, the resulting protein is an N-terminal fragment that is 38 kDa 
in size, because of the presence of a stop codon within exon 5 (SEQ ID NO: 15). 
When the mature transcript lacks exons 5 and 10, it gives rise to another N-terminal 
fragment 58 kDa in size. Additional variants may arise from translational frame 
shifting, as well. 

Additional polypeptide sequences or other moieties, such as covalently attached 
detectable tags, may be added to the proteins of the invention. Additional polypeptide 
sequences may fused to either the amino or carboxyl termini of the polypeptides of the 
invention, and they may be useful, for example, in assisting the expression, 
purification, and/or detection proteins of the invention. For example, these various 
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sequences include those well known in the art that are useful in purification of 
recombinantly expressed proteins. A preferred fusion protein, which the inventors 
have reduced to practice, comprises a "His 6 tag** sequence, which facilitates 
purification of the recombinantly expressed protein. A preferred purification system 
is the TALON™ nondenaturing protein purification kit for purifying His 6 -tagged 
proteins under native conditions (CLONTECH, Palo Alto, CA). 

"Isolated" polypeptides of the invention have been purified to remove at least 
some portion of cellular or non-cellular molecules with which the proteins are 
associated naturally. Isolated proteins include those that are partially purified or 
enriched, as well as those purified to homogeneity. Isolated proteins also include 
those produced artificially, such as by recombinant expression or by in vitro 
translation. The isolated protein may be included in compositions containing other 
polypeptides for specific purposes, for example, as stabilizers. 

"Substitutions, insertions, additions and deletions" refer to changes in a 
particular polypeptide sequence, or any one its naturally occurring splicing variants. 
"Substitutions" generally refer to alterations in the amino acid sequence that do not 
change the overall length of the polypeptide, but only alter one or more amino acid 
residues, substituting one for another in the common sense of the word. Generally 
speaking, the number of amino acid substitutions for any given variant will not be 
more than about 20, 10, 5, or 3, such as 1-20 or any range or value therein. 
Substitutions preferably are conservative, such that one amino acid is replaced with 
one of similar shape and charge. Conservative substitutions are well known in the 
art and include, for example, the changes of: alanine to serine; arginine to lysine; 
asparigine to glutamine or histidine; aspartate to glutamate; cysteine to serine; 
glutamine to asparigine; glutamate to aspartate; glycine to proline; histidine to 
asparigine or glutamine; isoleucine to leucine or valine; leucine to valine or 
isoleucine; lysine to arginine, glutamine, or glutamate; methionine to leucine or 
isoleucine; phenylalanine to tyrosine, leucine or methionine; serine to threonine; 
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threonine to serine; tryptophan to tyrosine; tyrosine to tryptophan or phenylalanine; 
and valine to isoleucine or leucine. 

"Insertions" add extra amino acids to the interior (not the amino- or carboxyl- 
terminal ends) of the subject polypeptide. Insertions include amino acids encoded by 
exons that are alternatively spliced into a polypeptide, such as the splicing variants 
shown in FIGURES 8 and 9. "Deletions" diminish the overall size of the 
polypeptide by removal of amino acids from the interior or either end of the 
polypeptide. In one embodiment, deletions remove less than about 30% of the size 
of the subject molecule. Other preferred deletions include naturally occurring 
splicing variants of a Potl protein, such as those described above. These variants 
may be fragments of the size the full-length protein, which may be considerably 
smaller than 30% the size of the full-length protein. 

"Additions," like insertions, also add to the overall size of the protein; 
however, instead of being made within the molecule, they are made on the N- or C- 
terminus of the encoded protein. Unlike deletions, additions may be of virtually any 
size; however, preferred additions do not exceed about 100% of the size of the native 
molecule. "Additions" also to encompass adducts to the amino acids of the native 
molecule. 

In general, both the DNA and protein molecules of the invention can be defined 
with reference to "sequence identity." As used herein, "sequence identity" refers to a 
comparison made between two molecules using standard algorithms well-known in the 
art. Although any sequence algorithm can be used to define "sequence identity," for 
clarity, the present invention defines identity with reference to the Smith- Waterman 
algorithm, where the open reading frame generally is used as the reference sequence to 
define the percentage identity of polynucleotide homologues over its length. When 
"sequence identity" is used with reference to a polypeptide, the designated polypeptide 
is used as a reference sequence over its length. 

The choice of parameter values for matches, mismatches, and inserts or 
deletions is arbitrary, although some parameter values have been found to yield more 
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biologically realistic results than others. One preferred set of parameter values for the 
Smith-Waterman algorithm is set forth in the "maximum similarity segments" 
approach, which uses values of 1 for a matched residue and D for a mismatched residue 
(a residue being either a single nucleotide or a single amino acid). Insertions and 
deletions ("indels"), x, are weighted as: 

x k = 1 + k/3, 

where k is the number of residues in a given insert or deletion (Waterman, Bulletin of 
Mathematical Biology 46:473-500 (1984)). 

Polynucleotides of the invention. 

Polynucleotides of the invention are those that encode Potl proteins or their 
fragments and derivatives. These polynucleotides include those that encode SpPotl 
polypeptides. An S. pombe genomic DNA sequence is described by the Sanger 
Centre as part of cosmid clone c26H5, having accession number SPAC26H5 (SEQ 
ID NO: 7). This sequence contains an upstream promoter region, a coding region 
with two introns, and a downstream region that contains a terminator. Both 
upstream and downstream regions may play a role in the regulation of SpPotlp 
expression. The introns can be alternatively spliced, as described above (SEQ ID 
NOS:8 and 10). Preferred polynucleotides are non-genomic; i.e., they correspond to 
transcripts from genomic DNA. An example of non-genomic DNA is a mRNA or 
cDNA encoding the polypeptides of SEQ ID NO: 9 or SEQ ID NO: 1 1 . 

The polynucleotides of the invention also include those that encode a hPotlp 
and its variants and fragments. A partial genomic clone is described for human 
P077, having accession number AC004925 (SEQ ID NO: 18). This partial genomic 
clone contains nine exons, shown diagrammatically in FIGURE 10G. Of these 
exons, at least exons 5 and 10 can be alternatively spliced (compare SEQ ID 
NOS:12, 14, and 16). Various cDNA sequences encoding full-length hPotlp have 
been described: FLJ10368 (submitted 22 Feb. 2000), FLJ11073 (submitted 22 Feb. 
2000), FLJ12518 (submitted 29 Sept. 2000), BC002923 (submitted 5 Feb. 2001), 
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and NM_015450 (submitted 26 Feb. 2001). Various other partial cDNA sequences 
and ESTs that encode portions of hPotl protein also have been described: FLJ22851 
(submitted 29 Sept. 2000), AL050120 (submitted 18 Feb. 2000). Of the hPOTl 
polynucleotides presently described, only the hPOTl cDNA of SEQ ID NO: 12 
closely resembles the sequences described in FLJ10368, FLJ11073, and FLJ12518. 

The invention also provides a nucleic acid molecule having a sequence 
complementary to one of the above sequences. Such isolated nucleic acid molecules 
are useful as probes for gene mapping by in situ hybridization with chromosomes. 
They are particularly useful for detecting transcription of a POT1 gene in human 
tissue, or transcripts of naturally occurring homologues that may themselves be 
therapeutically useful. 

The polynucleotides of the invention may also be useful for detecting 
transcripts of naturally occurring POT1 variants occurring in disease states. The 
present polynucleotides thus may have diagnostic application in differentiating normal 
and abnormal genes, based on differential hybridization, as discussed in more detail 
below. Alternatively, a diagnostic application may include differentiating abnormally 
high or low levels of expression of a normal gene. 

Isolated nucleic acid molecules of the present invention include nucleic acid 
molecules comprising the coding sequence for a Potl protein, and nucleic acid 
molecules which comprise a nucleotide sequence substantially different from those 
described above but which, due to the degeneracy of the genetic code, still encode at 
least one Potl protein as described and enabled herein. Of course, the genetic code 
is well-known in the art. Thus, it would be routine for one skilled in the art to 
generate such degenerate nucleic acid variants that code for specific Potl proteins of 
the present invention. See, e.g., Ausubel, et al. 

The term "hybridization" refers to formation of double stranded 
polynucleotides through complementary nucleotide base pairing. High stringency 
hybridization occurs at a temperature between about 65 °C and 70 °C in a 
hybridization solution of 6X SSC, 0.5% SDS, 5X Denhardfs solution and 100|ig of 



25 



non-specific carrier DNA. The preferred probe is 100 bases selected from contiguous 
bases of the polynucleotide sequence set forth in SEQ ID NO:l. A high stringency 
wash solution contains the equivalent in ionic strength of less than about 0.2X SSC and 
0.1 % SDS, with a preferred stringent solution containing about 0.1X SSC and 0.1 % 
SDS. High stringency washing conditions comprise washing with 2X SSC with 0.05 
% SDS five times at room temperature, then washing with 0.1X SSC with 0.1 % 
SDS at 68 °C for 1 h. Blots containing the hybridized, labeled probe are exposed to 
film for one to three days. 

"Isolated" nucleic acid molecules are removed from their native or naturally 
occurring environment. For example, recombinant nucleic acid molecules in a 
vector and/or a host cell are considered isolated for the purposes of the present 
invention. Isolated RNA molecules include in vivo or in vitro RNA transcripts of the 
nucleic acid molecules of the present invention. Isolated nucleic acid molecules 
according to the present invention further include such molecules produced 
synthetically or purified from cells containing such nucleic acids, where the nucleic 
acid is in other than a naturally occurring form. Isolated nucleic acid molecules 
include genomic DNA that has been removed from the chromosome in which it 
occurs naturally. 

Vectors of the invention. 

The term "vector" refers to a nucleic acid compound used for introducing 
exogenous nucleic acid into host cells. A vector comprises a nucleotide sequence 
which may encode one or more polypeptide molecules. Plasmids, cosmids, viruses, 
and bacteriophages, in a natural state or which have undergone recombinant 
engineering, are non-limiting examples of commonly used vectors to provide 
recombinant vectors comprising at least one desired isolated nucleic acid molecule. 

The term "promoter" refers to a nucleic acid sequence that directs the 
initiation of transcription. An inducible promoter is one that is regulated by 
environmental signals, such as carbon source, heat, or metal ions. 
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"Host cell" refers to any eukaryotic, prokaryotic, or other cell that is suitable 
for propagating and/or expressing an isolated nucleic acid that is introduced into the 
host cell by any suitable means known in the art. The cell can be part of a tissue or 
organism, isolated in culture or in any other suitable form. 

The present invention further provides recombinant expression cassettes 
comprising a nucleic acid of the present invention, and operably linked to 
transcriptional initiation regulatory sequences that will direct the transcription of the 
polynucleotide in the intended host cell. Both heterologous and endogenous 
promoters can be employed to direct expression. These promoters can also be used, 
for example, in recombinant expression cassettes to drive expression of antisense 
nucleic acids to reduce Potlp content in a desired tissue. 

In some embodiments, isolated nucleic acids which serve as promoter or 
enhancer elements can be introduced in the appropriate position (generally upstream) of 
a non-heterologous form of a polynucleotide of the present invention so as to up or 
down regulate expression of a polynucleotide of the present invention. For example, 
endogenous promoters can be altered in vivo by mutation, deletion, and/or substitution. 
Suitable promoters include the phage lambda PL promoter, the E. coli lac, trp and 
tac promoters, the SV40 early and late promoters and promoters of retroviral LTRs, 
to name a few. Other suitable promoters will be known to the skilled artisan. The 
expression constructs will further contain sites for transcription initiation, termination 
and, in the transcribed region, a ribosome binding site for translation. The coding 
portion of the mature transcripts expressed by the constructs will preferably include a 
translation initiation codon at the beginning and a termination codon (UAA, UGA or 
UAG) appropriately positioned at the end of the polypeptide to be translated. 

The polynucleotides can optionally be joined to a vector containing a 
selectable marker for propagation in a host. Such markers include, e.g., 
dihydrofolate reductase or neomycin resistance for eukaryotic cell culture and 
tetracycline or ampicillin resistance genes for culturing in E. coli and other bacteria. 
Representative examples of appropriate hosts include, but are not limited to, bacterial 
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cells, such as E. coli, Streptomyces and Salmonella typhimurium cells; fungal cells, 
such as yeast cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; 
animal cells such as CHO, COS and Bowes melanoma cells; and plant cells. 
Appropriate culture mediums and conditions for the above-described host cells are 
known in the art. Among vectors preferred for use in bacteria include pQE70, 
pQE60 and pQE-9, available from Qiagen; pBS vectors, Phagescript vectors, 
Bluescript vectors, pNH8A, pNH16a, pNH18A, pNH46A, available from 
Stratagene; and ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 available from 
Pharmacia. Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, 
pXTl and pSG available from Stratagene; and pSVK3, pBPV, pMSG and pSVL 
available from Pharmacia. Other suitable vectors will be readily apparent to the 
skilled artisan. Introduction of the construct into the host cell can be effected by 
calcium phosphate transfection, DEAE-dextran mediated transfection, cationic lipid- 
mediated transfection, electroporation, transduction, infection or other methods. 
Such methods are described in many standard laboratory manuals, such as Sambrook, 
supra, Chapters 1-4 and 16-18; Ausubel, supra, Chapters 1,9, 13, 15, 16. 

Recombinant protein expression. 

The polypeptide can be expressed in a modified form, such as a fusion 
protein, and can include not only secretion signals, but also additional heterologous 
functional regions. For instance, a region of additional amino acids, particularly 
charged amino acids, can be added to the N-terminus of a polypeptide to improve 
stability and persistence in the host cell, during purification, or during subsequent 
handling and storage. Also, peptide moieties can be added to a polypeptide to 
facilitate purification. Such regions can be removed prior to final preparation of a 
polypeptide. The addition of peptide moieties to polypeptides to engender secretion 
or excretion, to improve stability and to facilitate purification, among others, are 
familiar and routine techniques in the art. Such methods are described in many 
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standard laboratory manuals, such as Sambrook, supra, Chapters 17.29-17.42 and 
18.1-18.74; Ausubel, supra, Chapters 16, 17 and 18. 

A Potl polypeptide can be recovered and purified from recombinant cell 
cultures by well known methods. Polypeptides of the present invention include 
naturally purified products, products of chemical synthetic procedures, and products 
produced by recombinant techniques from a prokaryotic or eukaryotic host, 
including, for example, bacterial, yeast, higher plant, insect and mammalian cells. 
Polypeptides of the invention can also include an initial modified methionine residue, 
in some cases as a result of host-mediated processes. The monitoring of the 
purification process can be accomplished by DN A -binding activity assays, Western blot 
techniques, radioimmunoassay, or other standard immunoassay techniques. These 
methods are described in many standard laboratory manuals, such as Sambrook, 
supra, Chapters 17.37-17.42; Ausubel, supra, Chapters 10, 12, 13, 16, 18 and 20. 

Antibodies of the invention. 

Antibodies raised against the proteins and protein fragments of the invention 
also are contemplated by the invention. In particular, the invention contemplates 
antibodies raised against Potlp, and variants thereof. Described below are antibody 
products and methods for producing antibodies capable of specifically recognizing one 
or more epitopes of the presently described proteins and their derivatives. Antibodies 
include, but are not limited to polyclonal antibodies, monoclonal antibodies (mAbs), 
humanized or chimeric antibodies, single chain antibodies including single chain Fv 
(scFv) fragments, Fab fragments, F(ab') 2 fragments, fragments produced by a Fab 
expression library, anti-idiotypic (anti-Id) antibodies, epitope-binding fragments, and 
humanized forms of any of the above. 

As known to one in the art, these antibodies may be used, for example, in the 
detection of a target protein in a biological sample. They also may be utilized as part 
of treatment methods, and/or may be used as part of diagnostic techniques whereby 



29 



patients may be tested for abnormal levels or preferably for the presence of abnormal 
forms of the proteins. 

In general, techniques for preparing polyclonal and monoclonal antibodies as 
well as hybridomas capable of producing the desired antibody are well known in the art 
(Campbell, A.M., Monoclonal Antibody Technology: Laboratory Techniques in 
Biochemistry and Molecular Biology, Elsevier Science Publishers, Amsterdam, The 
Netherlands (1984); St. Groth et al., /. Immunol Methods 35:1-21 (1980); Kohler and 
Milstein, Nature 256:495-497 (1975)), the trioma technique, the human B-cell 
hybridoma technique (Kozbor et al., Immunology Today 4:72 (1983); Cole et al., in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc. (1985), pp. 77-96). 

i) Polyclonal antibodies. 

Polyclonal antibodies are heterogeneous populations of antibody molecules 
derived from the sera of animals immunized with an antigen, such as an inventive 
protein or an antigenic derivative thereof. Polyclonal antiserum, containing antibodies 
to heterogeneous epitopes of a single protein, can be prepared by immunizing suitable 
animals with the expressed protein described above, which can be unmodified or 
modified, as known in the art, to enhance immunogenicity. Immunization methods 
include subcutaneous or intraperitoneal injection of the polypeptide. 

Effective polyclonal antibody production is affected by many factors related 
both to the antigen and to the host species. For example, small molecules tend to be 
less immunogenic than others and may require the use of carriers and/or adjuvant. In 
addition, host animal response may vary with site of inoculation. Both inadequate or 
excessive doses of antigen may result in low titer antisera. In general, however, small 
doses (high ng to low jig levels) of antigen administered at multiple intradermal sites 
appears to be most reliable. Host animals may include but are not limited to rabbits, 
mice, and rats, to name but a few. An effective immunization protocol for rabbits can 
be found in Vaitukaitis, J. et al., J. Clin. Endocrinol. Metab. 33:988-991 (1971). 

The protein immunogen may be modified or administered in an adjuvant in 
order to increase the protein's antigenicity. Methods of increasing the antigenicity of a 
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protein are well known in the art and include, but are not limited to coupling the 
antigen with a heterologous protein or through the inclusion of an adjuvant during 
immunization. 

Booster injections can be given at regular intervals, with at least one usually 
being required for optimal antibody production. The antiserum may be harvested when 
the antibody titer begins to fall. Titer may be determined semi-quantitatively, for 
example, by double immunodiffusion in agar against known concentrations of the 
antigen. See, for example, Ouchterlony et al. t Chap. 19 in: Handbook of Experimental 
Immunology, Wier, ed, Blackwell (1973). Plateau concentration of antibody is usually 
in the range of 0.1 to 0.2 mg/ml of serum (about 12 jaM). The antiserum may be 
purified by affinity chromatography using the immobilized immunogen carried on a 
solid support. Such methods of affinity chromatography are well known in the art. 

Affinity of the antisera for the antigen may be determined by preparing 
competitive binding curves, as described, for example, by Fisher, Chap. 42 in: Manual 
of Clinical Immunology, second edition, Rose and Friedman, eds., Amer. Soc. For 
Microbiology, Washington, D.C. (1980). 

ii) Monoclonal antibodies. 

Monoclonal antibodies (MAbs), are homogeneous populations of antibodies to a 
particular antigen. They may be obtained by any technique that provides for the 
production of antibody molecules by continuous cell lines in culture or in vivo. MAbs 
may be produced by making hybridomas, which are immortalized cells capable of 
secreting a specific monoclonal antibody. 

Monoclonal antibodies to any of the proteins, peptides and epitopes thereof 
described herein can be prepared from murine hybridomas according to the classical 
method of Kohler, G. and Milstein, C, Nature 256:495-497 (1975) (and U.S. Patent 
No. 4,376,110) or modifications of the methods thereof, such as the human B-cell 
hybridoma technique (Kosbor et al., 1983, Immunology Today 4:72; Cole et aL, 1983, 
Proc. Natl Acad. Sci. USA 80: 2026-2030), and the EBV-hybridoma technique (Cole 
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et al, 1985, MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan R. 
Liss, Inc., pp. 77-96). 

In one method a mouse is repetitively inoculated with a few micrograms of the 
selected protein over a period of a few weeks. The mouse is then sacrificed, and the 
antibody producing cells of the spleen are isolated. The spleen cells are fused, 
typically using polyethylene glycol, with mouse myeloma cells, such as SP2/0-Agl4 
myeloma cells. The excess, unfused cells are destroyed by growth of the system on 
selective media comprising aminopterin (HAT media). The successfully fused cells are 
diluted, and aliquots are plated to microliter plates where growth is continued. 
Antibody-producing clones (hybridomas) are identified by detection of antibody in the 
supernatant fluid of the wells by immunoassay procedures. These include ELISA, as 
originally described by Engvall, Meth. EnzymoL 70:419 (1980), western blot analysis, 
radioimmunoassay (Lutz et aL, Exp. Cell Res. 175:109-124 (1988)) and modified 
methods thereof. 

Selected positive clones can be expanded and their monoclonal antibody product 
harvested for use. Detailed procedures for monoclonal antibody production are 
described in Davis, L. et al BASIC METHODS IN MOLECULAR BIOLOGY, 
Elsevier, New York. Section 21-2 (1989). The hybridoma clones may be cultivated in 
vitro or in vivo, for instance as ascites. Production of high titers of mAbs in vivo 
makes this the presently preferred method of production. Alternatively, hybridoma 
culture in hollow fiber bioreactors provides a continuous high yield source of 
monoclonal antibodies. 

The antibody class and subclass may be determined using procedures known in 
the art (Campbell, A.M., Monoclonal Antibody Technology: Laboratory Techniques in 
Biochemistry and Molecular Biology , Elsevier Science Publishers, Amsterdam, The 
Netherlands (1984)). MAbs may be of any immunoglobulin class including IgG, IgM, 
IgE, IgA, IgD and any subclass thereof. Methods of purifying monoclonal antibodies 
are well known in the art. 

Hi) Antibody derivatives and fragments. 
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Fragments or derivatives of antibodies include any portion of the antibody 
which is capable of binding the target antigen, or a specific portion thereof. Antibody 
fragments specifically include F(ab') 2 , Fab, Fab* and Fv fragments. These can be 
generated from any class of antibody, but typically are made from IgG or IgM. They 
may be made by conventional recombinant DNA techniques or, using the classical 
method, by proteolytic digestion with papain or pepsin. See CURRENT 
PROTOCOLS IN IMMUNOLOGY, chapter 2, Coligan et al., eds., (John Wiley & 
Sons 1991-92). 

F(ab') 2 fragments are typically about 110 kDa (IgG) or about 150 kDa (IgM) 
and contain two antigen-binding regions, joined at the hinge by disulfide bond(s). 
Virtually all, if not all, of the Fc is absent in these fragments. Fab' fragments are 
typically about 55 kDa (IgG) or about 75 kDa (IgM) and can be formed, for example, 
by reducing the disulfide bond(s) of an F(ab') 2 fragment. The resulting free sulfhydryl 
group(s) may be used to conveniently conjugate Fab' fragments to other molecules, 
such as detection reagents (e.g., enzymes). 

Fab fragments are monovalent and usually are about 50 kDa (from any source). 
Fab fragments include the light (L) and heavy (H) chain, variable (V L and V H , 
respectively) and constant (C L and C H , respectively) regions of the antigen-binding 
portion of the antibody. The H and L portions are linked by an intramolecular 
disulfide bridge. 

Fv fragments are typically about 25 kDa (regardless of source) and contain the 
variable regions of both the light and heavy chains (V L and V H , respectively). Usually, 
the V L and V H chains are held together only by non-covalent interacts and, thus, they 
readily dissociate; however, they have the advantage of small size and they retain the 
same binding properties of the larger Fab fragments. Accordingly, methods have been 
developed to crosslink the V L and V H chains, using, for example, glutaraldehyde (or 
other chemical crosslinkers), intermolecular disulfide bonds (by incorporation of 
cysteines) and peptide linkers. 
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Other antibody derivatives include single chain antibodies (U.S. Patent 
4,946,778; Bird, Science 242:423-426 (1988); Huston et al, Proc. Natl. Acad. Sci. 
USA 85:5879-5883 (1988); and Ward et al, Nature 334:544-546 (1989)). Single 
chain antibodies are formed by linking the heavy and light chain fragments of the Fv 
region via an amino acid bridge, resulting in a single chain Fv (SCFv). 

One preferred method involves the generation of scFvs by recombinant 
methods, which allows the generation of Fvs with new specificities by mixing and 
matching variable chains from different antibody sources. In a typical method, a 
recombinant vector would be provided which comprises the appropriate regulatory 
elements driving expression of a cassette region. The cassette region would contain a 
DNA encoding a peptide linker, with convenient sites at both the 5' and 3* ends of the 
linker for generating fusion proteins. The DNA encoding a variable region(s) of 
interest may be cloned in the vector to form fusion proteins with the linker, thus 
generating a scFv. 

In an exemplary alternative approach, DNAs encoding two Fvs may be ligated 
to the DNA encoding the linker, and the resulting tripartite fusion may be ligated 
directly into a conventional expression vector. The scFv DNAs generated any of these 
methods may be expressed in prokaryotic or eukaryotic cells, depending on the vector 
chosen. 

Antibody fragments that recognize specific epitopes may be generated by 
known techniques. For example, such fragments include but are not limited to: the 
F(ab') 2 fragments which can be produced by pepsin digestion of the antibody molecule 
and the Fab fragments which can be generated by reducing the disulfide bridges of the 
F(ab) 2 fragments. Alternatively, Fab expression libraries may be constructed (Huse et 
al., 1989, Science, 246:1275-1281) to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity. 

Derivatives also include "chimeric antibodies" (Morrison et al., Proc. Natl. 
Acad. Sci., 81:6851-6855 (1984); Neuberger et al, Nature, 312:604-608 (1984); 
Takeda et al, Nature, 314:452-454 (1985)). These chimeras are made by splicing the 
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DNA encoding a mouse antibody molecule of appropriate specificity with, for instance, 
DNA encoding a human antibody molecule of appropriate specificity. Thus, a 
chimeric antibody is a molecule in which different portions are derived from different 
animal species, such as those having a variable region derived from a murine mAb and 
a human immunoglobulin constant region. These are also known sometimes as 
"humanized" antibodies and they offer the added advantage of at least partial shielding 
from the human immune system. They are, therefore, particularly useful in therapeutic 
in vivo applications. 

iv) Labeled antibodies. 

The present invention further provides the above-described antibodies in 
detectably labeled form. Antibodies can be detectably labeled through the use of 
radioisotopes, affinity labels (such as biotin, avidin, etc.), enzymatic labels (such as 
horseradish peroxidase, alkaline phosphatase, etc.), fluorescent labels (such as FITC or 
rhodamine, etc.), paramagnetic atoms, etc. Procedures for accomplishing such 
labeling are well-known in the art, for example see (Sternberger et al, J. Histochem. 
Cytochem. 18:315 (1970); Bayer et al, Meth. Enzym. 62:308 (1979); Engval et al, 
Immunol 109:129 (1972); Goding, J. Immunol Meth. 13:215 (1976)). The labeled 
antibodies of the present invention can be used for in vitro, in vivo, and in situ 
diagnostic assays. 

v) Immobilized antibodies. 

The foregoing antibodies also may be immobilized on a solid support. 
Examples of such solid supports include plastics such as polycarbonate, complex 
carbohydrates such as agarose and sepharose, acrylic resins and such as polyacrylamide 
and latex beads. Techniques for coupling antibodies to such solid supports are well 
known in the art (Weir et al, "Handbook of Experimental Immunology" 4th Ed., 
Blackwell Scientific Publications, Oxford, England, Chapter 10 (1986); Jacoby et al, 
Meth. Enzym. 34 Academic Press, N.Y. (1974)). The immobilized antibodies of the 
present invention can be used for in vitro, in vivo, and in situ assays as well as for 
immunoaffinity purification of the proteins of the present invention. 
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Pharmaceutical compositions comprising a P0T1 gene. 

Pharmaceutical compositions comprising polynucleotides encoding functional 
Potl polypeptides of the invention are those useful for gene therapy to cause the 
overexpression of functional Potl polypeptides in cells in which chromosome 
stabilization is desired, or the overexpression of a variant Potl polypeptide with 
dominant negative interference activity in cells in which chromosome destabilization is 
desired. 

Overexpression of POT1 in a cell may be accomplished by transfecting a cell 
with a POT1 polynucleotide. The POT1 polynucleotide generally is a component on an 
expression vector of the invention, defined above. The vector may be delivered to a 
cell by transfection of a cell ex vivo, followed by selection and cloning of transfected 
cells expressing the POT1 nucleotide and then by administration of the stably 
transfected cells to an individual in need of the modified cells. 

Alternatively, the POT1 polynucleotide may be delivered to a cell or a 
population of cells in an individual. Various methods of introducing exogenous genes 
into cells in vivo are known in the art. See Rosenberg et ai y Science 242:1575-1578 
(1988) and Wolff et al. 9 PNAS 86:9011-9014 (1989), which are incorporated herein by 
reference. A listing of suitable vectors is set forth in Hodgson, Bio/Technology 13: 
222 (1995), which is incorporated by reference. One example of a suitable vector is a 
cationic liposome, such as DC-Chol/DOPE liposome, which is an appropriate vehicle 
to deliver DNA to a wide range of tissues through intravenous injection of 
DNA/cationic liposome complexes. See Caplen et ai, Nature Med. 1:39-46 (1995) 
and Zhu et ai , Science 267:209-21 1 (1993), herein incorporated by reference. 

Viral vector-mediated gene transfer is also a suitable method for the 
introduction of the vector into a target cell. Appropriate viral vectors include 
adenovirus vectors and adeno-associated virus vectors, retrovirus vectors and 
herpesvirus vectors. Adenoviruses are linear, double stranded DNA viruses complexed 
with core proteins and surrounded by capsid proteins. The common serotypes 2 and 5, 
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which are not associated with any human malignancies, are typically the base vectors. 
By deleting parts of the virus genome and inserting the desired gene under the control 
of a constitutive viral promoter, the virus becomes a replication-deficient vector 
capable of transferring the exogenous DNA to differentiated, non-proliferating cells. 
To enter cells, the adenovirus interacts with specific receptors on the cell surface, and 
the adenovirus surface proteins interact with the cell surface integrins. The virus 
penton-cell integrin interaction provides the signal that brings the exogenous gene- 
containing virus into a cytoplasmic endosome. The adenovirus breaks out of the 
endosome and moves to the nucleus, the viral capsid falls apart, and the exogenous 
DNA enters the cell nucleus where it functions, in an epichromosomal fashion, to _ . 
express the exogenous gene. Detailed discussions of the use of adenoviral vectors for 
gene therapy can be found in Berkner, Biotechniques 6:616-629 (1988) and Trapnell, 
Advanced Drug Delivery Rev. 72:185-199 (1993), which are herein incorporated by 
reference. Adenovirus-derived vectors, particularly non-replicative adenovirus vectors, 
are characterized by their ability to accommodate exogenous DNA of 7.5 kB, relative 
stability, wide host range, low pathogenicity in man, and high titers (10 4 to 10 5 plaque 
forming units per cell). See Stratford-Perricaudet et al , PNAS 59:2581 (1992). 

Pharmaceutical compositions may be formulated with one or more 
physiologically acceptable carriers or excipients. In one embodiment, the composition 
is formulated for injection. Long acting formulations are generally known in the art and 
can be adapted to the administration of a POT1 polynucleotide. Such compositions 
may be in the form of suspensions, solutions, emulsions in vesicles, or any other form 
known in the art. Additional suspending, stabilizing, or dispersing agents may be 
added as necessary. Alternatively, the active ingredient may be in the form of a 
powder for reconstitution prior to administration. 

Diagnostic methods. 

The present invention also contemplates methods for diagnosis of human 
disease. In particular, patients can be screened for the occurrence of cancers, or 
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likelihood of occurrence of cancers, associated with mutations in the Potl protein or 
with changes in its level of expression. By examining a number of patients in this 
manner, mutations in the gene that are associated with a malignant cellular phenotype 
can be identified. In addition, correlation of the nature of the observed mutations with 
subsequent observed clinical outcomes allows development of prognostic model for the 
predicted outcome in a particular patient. 

Screening for mutations conveniently can be carried out at the DNA level by 
use of PCR, although the skilled artisan will be aware that many other well known 
methods are available for the screening. PCR primers can be selected that flank known 
mutation sites, and the PCR products can be sequenced to detect the occurrence of the 
mutation. Alternatively, the 3' residue of one PCR primer can be selected to be a 
match only for the residue found in the unmutated gene. If the gene is mutated, there 
will be a mismatch at the 3' end of the primer, and primer extension cannot occur, and 
no PCR product will be obtained. Alternatively, primer mixtures can be used where 
the 3' residue of one primer is any nucleotide other than the nonmutated residue. 
Observation of a PCR product then indicates that a mutation has occurred. Other 
methods of using, for example, oligonucleotide probes to screen for mutations are 
described, or example, in U.S. Patent No. 4,871,838, which is herein incorporated by 
reference in its entirety. 

Alternatively, antibodies can be generated that selectively bind either mutated or 
non-mutated Potl protein. The antibodies then can be used to screen tissue samples for 
occurrence of mutations in a manner analogous to the DNA-based methods described 
above. 

The diagnostic methods described above can be used not only for diagnosis and 
for prognosis of existing disease, but may also be used to predict the likelihood of the 
future occurrence of disease. For example, clinically healthy patients can be screened 
for mutations in the Potl protein that correlate with later disease onset. Such mutations 
may be observed in the heterozygous state in healthy individuals. In such cases a 
single mutation event can effectively disable proper functioning of the gene encoding 
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the Potl protein and induce a transformed or malignant phenotype. This screening also 
may be carried out prenatally or neonatally. 

DNA molecules according to the invention also are well suited for use in so- 
called "gene chip" diagnostic applications. Such applications have been developed by, 
inter alia, Synteni and Asymetrix. Briefly, all or part of the DNA molecules of the 
invention can be used either as a probe to screen a polynucleotide array on a "gene 
chip," or they may be immobilized on the chip itself and used to identify other 
polynucleotides via hybridization to the surface of the chip. In this manner, for 
example, related genes can be identified, or expression patterns of the POT] gene in 
various tissues can be simultaneously studied. Such gene chips have particular 
application for diagnosis of disease, or predisposition to disease, which may be 
indicated by a change in the level or tissue distribution of POT] mRNA or by the 
presence of a particular POT I mRNA species. Suitable chip technology is described 
for example, in Wodicka et aL, Nature Biotechnology, 15:1359 (1997) which is hereby 
incorporated by reference. 

Detection of a Potl polypeptide. 

The presence of a Potl protein may be assayed in a biological sample isolated 
from an individual. Potlp may be detected in any number of ways commonly known 
in the art. For example, Potlp may be detected by a specific interaction with a 
labeled antibody of the invention. The antibody label allows rapid detection of an 
immune complex by such well known methods as Western blotting. Formation of an 
immune complex will be useful in detecting Potl proteins with or without biological 
function. Thus, an immune complex formation will be the preferred mode of 
detection of a Potl protein in a sample from an individual, where the Potl protein in 
the sample is suspected of lacking activity through genetic alteration. Such an assay 
thus will be useful in a diagnostic method, to detect altered forms of Potlp. 

Alternatively, a Potl protein may be assayed by virtue of its biological 
function. In one embodiment, a sample suspected of containing a Potl polypeptide 
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is exposed to isolated labeled telomeric DNA. A Potl protein is then detected by its 
ability to interact with the telomeric DNA. A convenient method of assaying this 
interaction is with a gel shift assay, which is well known in the art and used to form 
the Potlp-DNA complexes in Example 2. 

Pharmaceutical compositions comprising compounds that affect Potlp activity, and 
routes of administering the same. 

Pharmaceutical compositions comprising compounds that affect Potl protein 
activity can be formulated and administered according to well known methods. These 
compounds include those small molecule compounds that affect Potlp binding to 
telomeric DNA identified by the screening methods of the invention. These 
compounds may be delivered in a pharmaceutically acceptable carrier vehicle. Suitable 
vehicles and their formulation are described, for example, in Remington's 
Pharmaceutical Sciences (16th ed., Osol, A., Ed., Mack, Easton PA (1980)). 

Pharmaceutical compositions are formulated to provide a "therapeutically 
effective amount" of a compound that affects the activity of a Potl protein. The 
amount of a compound required for therapeutic efficacy depends on the individual or 
animal to be treated, and on the precise condition involving a Potl protein. The 
amount actually administered will be optimized to reduce side-effects while having a 
maximum effect on the activity of a Potl protein. Preferably, the amount delivered 
to the body will be reduced by directed delivery to a population of target cells, where 
possible. 

Pharmaceutical compositions for use in accordance with the present invention 
may be formulated in conventional manner using one or more physiologically 
acceptable carriers or excipients. Thus, the compounds and their physiologically 
acceptable salts may be formulated for administration by a variety of routes. The 
compounds may be delivered by parenteral, inhalation or insufflation (either through 
the mouth or the nose), topical, oral, or depot administration. 
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The compounds may be formulated for parenteral administration by injection, 
e.g., by bolus injection, repeated injections, or continuous infusion. Formulations for 
injection may be presented in unit dosage form, e.g., in ampules or in multi-dose 
containers, with an added preservative. The compositions may take such forms as 
suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain agents 
that aid in suspending, stabilizing or dispersing the active compounds. Alternatively, 
the active ingredient may be in powder form for constitution with a suitable vehicle, 
e.g., sterile pyrogen-free water, before use. Instead of injection, the compounds may 
be administered as an irrigation fluid used to wash areas or organs of the body. 

For administration by inhalation, the compounds for use according to the 
present invention are conveniently delivered in the form of an aerosol spray 
presentation from pressurized packs or a nebuliser, with the use of a suitable 
propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoro- 
ethane, carbon dioxide or other suitable gas. In the case of a pressurized aerosol the 
dosage unit may be determined by providing a valve to deliver a metered amount. 
Capsules and cartridges of, e.g. gelatin for use in an inhaler or insufflator may be 
formulated containing a powder mix of the compound and a suitable powder base such 
as lactose or starch. 

In addition to the formulations described previously, the compounds may also 
be formulated as a depot preparation. Such long acting formulations may be 
administered by implantation (for example subcutaneously or intramuscularly) or by 
intramuscular injection. The compounds may also be formulated in rectal compositions 
such as suppositories or retention enemas, e.g., containing conventional suppository 
bases such as cocoa butter or other glycerides. 

For oral administration, the pharmaceutical compositions may take the form of, 
for example, tablets or capsules prepared by conventional means with pharmaceutical^ 
acceptable excipients such as binding agents (e.g., pregelatinised maize starch, 
polyvinylpyrrolidone or hydroxypropyl methylcellulose); fillers (e.g., lactose, 
microcrystalline cellulose or calcium hydrogen phosphate); lubricants (e.g., magnesium 
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stearate, talc or silica); disintegrants (e.g., potato starch or sodium starch glycolate); or 
wetting agents (e.g., sodium lauryl sulphate). The tablets may be coated or formulated 
for sustained release by methods well known in the art. Liquid preparations for oral 
administration may take the form of, for example, solutions, syrups or suspensions, or 
they maybe presented as a dry product for constitution with water or other suitable 
vehicle before use. Such liquid preparations may be prepared by conventional means 
with pharmaceutically acceptable additives such as suspending agents (e.g., sorbitol 
syrup, cellulose derivatives or hydrogenated edible fats); emulsifying agents (e.g., 
lecithin or acacia); non-aqueous vehicles (e.g., almond oil, oily esters, ethyl alcohol or 
fractionated vegetable oils); and preservatives (e.g., methyl or propyl-p- 
hydroxybenzoates or sorbic acid). The preparations may also contain buffer salts, 
flavoring, coloring and sweetening agents as appropriate. 

Preferred formulations for oral delivery are described by US Pat. Nos. 
5,574,018 and 5,428,023. Biologically active conjugates of a therapeutically useful 
protein are made with vitamin B 12 (VB l2 ) by covalently binding the primary (5') 
hydroxy 1 group of the ribose moiety of VB 12 to the therapeutic protein. When the 
resulting conjugate is orally delivered, it binds intrinsic factor (IF) transporter 
protein in the gastrointestinal tract and is then taken up through the epithelium into 
the bloodstream, retaining the biological activity of the protein therapeutic. The 
conjugates may be orally administered in the presence of purified IF, resulting in 
greater absorption. 

WO 93/25221 describes compositions formulated for oral delivery, 
comprising therapeutic proteins contained in microspheres made of protein and/or 
synthetic polymer. The microspheres protect their protein contents ^against 
gastrointestinal proteases and provide controlled and sustained release of their 
contents. Microspheres can be designed to pass through the intestinal epithelium into 
the blood or lymph, and they may be targeted to particular cells or organs. 
Formulations and methodology useful for targeting orally administered microparticles 
to various organs are described in EP 531,497, for example. 
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The present invention, thus generally described, will be understood more 
readily by reference to the following examples, which are provided by way of 
illustration and are not intended to be limiting of the present invention. 

EXAMPLES 
Example 1: Expression and purification of SpPotlp. 

SpPotlp containing N-terminal V5 and His 6 -tags was cloned into the pQE30 
expression vector (Qiagen), which introduces an additional N-terminal His 6 -tag, and 
expressed in E. coli strain M15 (pRep4) using tryptone phosphate media. Following 
induction (0.8 mM IPTG) for 6 hours at 24°C cells were harvested, resuspended in 
lysis buffer at pH 8.0 (50 mM NaH 2 P0 4 ; 0.1 M NaCl; 2 mM imidazole; 10 % 
glycerol; 0.2% Tween20; 5 mM p-mercaptoethanol, 1 mM PMSF) and lysed by the 
addition of lysozyme (0.5 mg/ml). After 30 min the concentration of NaCl was 
increased to 0.6 M, genomic DNA was sheared by sonication and cell debris was 
removed by centrifugation at 10,000 g for 30 min. The supernatant was incubated 
with Ni-NTA resin (Qiagen) at 4°C for 90 min, which was then loaded onto a 
column and washed sequentially with P buffer (50 mM NaH 2 P0 4 ; 600 mM NaCl; 10 
% glycerol; 0.2% Tween20; 5 mM P-mercaptoethanol) containing increasing 
concentrations of imidazole. Potlp eluted around 90 mM imidazole. Potl containing 
fractions were dialysed against T buffer (50 mM Tris/HCl pH 8.0; 10% glycerol; 0.5 
mM EDTA; 0.5 mM DTT) containing 0.2 M KC1 and Potlp was further purified on 
a Q-sepharose column (Pharmacia) using a linear gradient of KC1 (0.2 M - 1 M). 
Potlp eluted around 0.5M KC1, was dialysed against T buffer plus 0.2 M KC1 and 
stored in aliquots at -80°C. 

Example 2: DNA-binding specificity of SpPotlp. 

C-strand (CGTAACCGTAACCCTGTAACCTGTAACCTGTAACCGTG- 
TAACC) (SEP ID NO: 40) and G-strand (GGTTACACGGTTACAGGTTACAGGT 
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TACAGGGTTACGGTTACG) (SEP ID NO: 28) were 5' 32 P-labeled using T4 
polynucleotide kinase and y- 32 P-ATP. Duplex DNA was generated by annealing 
equimolar amounts of radiolabeled C-strand and unlabelled G-strand. Binding 
reactions (10 nl) were carried out in 25 mM HEPES (pH 7.5), 1 mM EDTA, 50 mM 
NaCl, 5% glycerol, and 2.5 jiM PBolil09 (CCGTAAGCATTTCATTATTGGAATT 
CGAGCTCGTTTTCGA) (SEP ID NG:41) as non-specific competitor. Potlp (50 
ng) was incubated with the indicated DNA substrates (1 ng) for 15 min at 20°C. 
Complexes were analyzed by electrophoresis at 4°C through a 4-20% TBE gel 
(Invitrogen) run at 150 V for 80 min. The Potlp-DNA complex is indicated by an 
open arrow in FIGURE 3A. FIGURE 3B shows the same experiment except that the 
added protein (100 ng) contained truncated Potlp as well as full length protein. 
Truncated Potlp-DNA complex is indicated by a closed arrow. 

Example 3: Substrate specificity ofSpPotlp and hPotlp. 

FIGURE 6A shows binding of SpPotlp to radiolabeled 5. pombe and human 
G-strand DNAs. FIGURE 6B shows binding of SpPotlp (50 ng) to radiolabeled G- 
strand (15 pg or 1.5 fmol) in the presence of 10-, 100-, and 1000-fold excess of 
unlabeled 5. pombe, human or O. nova G-strand. FIGURE 6C shows binding of 
hPotlp to radiolabeled S. pombe and human G-strand DNAs. FIGURE 6D shows 
binding of hPotlp to radiolabeled human G-strand DNA under same conditions as in 
FIGURE 6B. 

Example 4: Cloning of the hPOTl gene. 

Gligos PBolil64T (SEP ID NP:42) (TTCAGATGTTATCTGTCAATCAG 
AACCTG) and PBolil94B (GAACACTGTTTACATCCATAGTGATGTATTGTT 
CC) were used to amplify a 614 bp fragment of hPOTl from multiple tissue cDNA 
panels (Clontech) with Advantage 2 Polymerase mix in the buffer supplied by 
Clontech. Cycling parameters of touch-down PCR were 94°C for 5 s, 68°C for 120 
s (32 cycles). The gene encoding glyceraldehyde phosphate dehydrogenase (GAPDH) 
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was used as a positive control for the integrity of the cDNA sample and was 
amplified for 26 cycles with primers (SEP ID NO:44) TGAAGGTCGGAGT- 
CAACGGATTTGGT and (SEP ID NP:45) CATGTGGGCCATGAGGTC- 
CACCAC. 

hPOTl was PCR amplified from ovary cDNA and cloned into a pQE30 
expression vector. Recombinantly expressed hPotlp (carrying an N-terminal His 6 - 
tag) was purified from E. coli. The protein was purified over Ni-NTA resin under 
the same conditions as SpPotlp. The human protein eluted at around 135 mM 
imidazole. 

The description, specific examples, and data, while indicating exemplary 
embodiments, are given by way of illustration and are not intended to limit the 
present invention. Various changes and modifications within the present invention 
will become apparent to the skilled artisan from the disclosure, and thus are 
considered part of the invention. 
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What is claimed is: 

1. A method of identifying a compound that regulates the binding of Potl 
to telomeric DNA, comprising detecting whether a candidate compound regulates the 
binding of a Potl polypeptide to a single-stranded telomeric DNA. 

2. The method of Claim 1 , wherein the candidate compound is exposed 
to a Potl polypeptide-telomeric DNA complex. 

3. The method of Claim 1, wherein the candidate compound is exposed 
to the Potl polypeptide prior to exposure to the telomeric DNA. 

4. The method of Claim 1, wherein the step of detecting comprises 
detecting whether the candidate compound strengthens the interaction between the 
Potl polypeptide and the telomeric DNA. 

5. The method of Claim 1, wherein the step of detecting comprises 
detecting whether the candidate compound stabilizes the interaction between the Potl 
polypeptide and the telomeric DNA. 

6. The method of Claim 1, wherein the step of detecting comprises 
detecting whether the candidate compound weakens the interaction between the Potl 
polypeptide and the telomeric DNA. 

7. The method of Claim 1, wherein the step of detecting comprises 
detecting whether the candidate compound disrupts the interaction between the Potl 
polypeptide and the telomeric DNA. 

8. The method of Claim 1, wherein the step of detecting comprises 
detecting whether the candidate compound interacts with the Potl polypeptide or a 
complex between the Potl polypeptide and the telomeric DNA to change the binding 
constant of the complex between the Potl polypeptide and the telomeric DNA. 

9. The method of Claim 1, wherein the step of detecting is performed by 
detecting the ability of the candidate compound to change the amount of a labeled 
probe comprising a fragment of single-stranded telomeric DNA that interacts with the 
Potl polypeptide. 
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10. The method of Claim 1, wherein the step of detecting is performed 
using an electrophoretic mobility shift assay. 

1 1 . The method of Claim 1 , wherein the step of detecting is performed 
using a high throughput assay for screening candidate compounds simultaneously. 

12. The method of Claim 1, wherein the step of detecting is performed 
using an isolated cell that recombinantly expresses the Potl polypeptide. 

13. The method of Claim 1, further comprising testing candidate 
compounds that regulate the binding of a Potl polypeptide to single-stranded 
telomeric DNA to determine whether the candidate compounds regulate telomere 
length or integrity throughout repeated divisions in a cell culture system. 

14. The method of Claim 1, wherein the Potl polypeptide is selected from 
the group consisting of: 

a) a Potl polypeptide comprising an amino acid sequence selected from 
the group consisting of: SEQ ID NO:5, SEQ ID NO: 13, SEQ ID NO: 15 and SEQ ID 
NO:17. 

b) a Potl polypeptide comprising an amino acid sequence that is at least 
about 85% identical to an amino acid sequence selected from the group consisting of: 
SEQ ID NO:5, SEQ ID NO: 13, SEQ ID NO: 15 and SEQ ID NO: 17, wherein the 
polypeptide binds single-stranded telomeric DNA; and 

c) a fragment of a Potl polypeptide as set forth in (a) or (b), wherein the 
fragment binds single-stranded telomeric DNA. 

15. The method of Claim 1, wherein the Potl polypeptide comprises an 
amino acid sequence that is at least about 90% identical to an amino acid sequence 
selected from the group consisting of: SEQ ID NO:5, SEQ ID NO: 13, SEQ ID 
NO: 15 and SEQ ID NO: 17, wherein the polypeptide binds single-stranded telomeric 
DNA. 

16. The method of Claim 1, wherein the Potl polypeptide is a fragment of 
an amino acid sequence selected from the group consisting of: SEQ ID NO:5, SEQ 
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ID NO: 13, SEQ ID NO: 15 and SEQ ID NO: 17, wherein the fragment binds single- 
stranded telomeric DNA. 

17. The method of Claim 1, wherein the Potl polypeptide comprises an 
amino acid sequence selected from the group consisting of: SEQ ID NO: 5, SEQ ID 
NO: 13, SEQ ID NO: 15 and SEQ ID NO: 17. 

18. The method of Claim 1, wherein the Potl polypeptide comprises SEQ 
ID NO:5. 

19. The method of Claim 1, wherein the single-stranded telomeric DNA is 

G-rich. 

20. The method of Claim 1, wherein the single-stranded telomeric DNA 
comprises TTAGGG (positions 1-6 of SEQ ID NO:20) repeats. 

21. The method of Claim 1, wherein the single-stranded telomeric DNA 
comprises a nucleic acid sequence selected from the group consisting of any one of 
SEQ ID NOs:36-38. 

22. The method of Claim 1, wherein the candidate compound is selected 
from the group consisting of: a small organic molecule, an oligonucleotide, and a 
non-hydrolyzable DNA analogue. 

23. A method of identifying a compound that interferes with the binding of 
a Potl polypeptide to a single-stranded telomeric DNA, comprising determining 
whether the candidate compound decreases the binding of the Potl polypeptide to a 
single-stranded telomeric DNA molecule in a mixture comprising the single-stranded 
telomeric DNA molecule, the polypeptide, and the candidate compound. 
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ABSTRACT OF THE DISCLOSURE 

A protein identified in humans and Schizosaccharomyces pombe, Potlp, 
binds single-stranded telomeric DNA and both stabilizes chromosome ends and 
regulates telomerase activity. Compounds that stabilize or disrupt the Potlp-DNA 
interaction will be useful in regulating the telomere length of a cell. Because 
telomere length is involved in the regulation of cellular life-span, the life-span of 
useful cell populations may be prolonged or undesirable cells may be caused to cease 
proliferation. The identification of a Potl protein and its encoding DNA provides 
methods of screening useful compounds or diagnosing illnesses that involve altered 
expression or structure of a Potl protein or gene. 
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